Recognizing protein substructure similarity using segmental threading.

نویسندگان

  • Sitao Wu
  • Yang Zhang
چکیده

Protein template identification is essential to protein structure and function predictions. However, conventional whole-chain threading approaches often fail to recognize conserved substructure motifs when the target and templates do not share the same fold. We developed a new approach, SEGMER, for identifying protein substructure similarities by segmental threading. The target sequence is split into segments of two to four consecutive or nonconsecutive secondary structural elements, which are then threaded through PDB to identify appropriate substructure motifs. SEGMER is tested on 144 nonredundant hard proteins. When combined with whole-chain threading, the TM-score of alignments and accuracy of spatial restraints of SEGMER increase by 16% and 25%, respectively, compared with that by the whole-chain threading methods only. When tested on 12 free modeling targets from CASP8, SEGMER increases the TM-score and contact accuracy by 28% and 48%, respectively. This significant improvement should have important impact on protein structure modeling and functional inference.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparative Modeling of Mainly - Beta Proteins by Profile

The ability to predict structure from sequence is particularly important for toxins, virulence factors, allergens, cytokines, and other proteins of public heath importance. Many such functions are represented in the parallel /-helix fold class. Structure prediction for this fold is a challenging computational problem because there exists very little sequence similarity (less than 15%) across th...

متن کامل

Unleashing the power of meta-threading for evolution/structure-based function inference of proteins

Protein threading is widely used in the prediction of protein structure and the subsequent functional annotation. Most threading approaches employ similar criteria for the template identification for use in both protein structure and function modeling. Using structure similarity alone might result in a high false positive rate in protein function inference, which suggests that selecting functio...

متن کامل

Fold Recognition Using Sequence Fingerprints of Protein Local Substructures

A protein local substructure (descriptor) is a set of several short non-overlapping fragments of the polypeptide chain. Each substructure describes local environment of a particular residue and includes only those segments of the main chain that are located in the proximity of that residue. Similar descriptors from the representative set of proteins were analyzed to reveal links between the sub...

متن کامل

Raptor: Optimal Protein Threading by Linear Programming

This paper presents a novel linear programming approach to do protein 3-dimensional (3D) structure prediction via threading. Based on the contact map graph of the protein 3D structure template, the protein threading problem is formulated as a large scale integer programming (IP) problem. The IP formulation is then relaxed to a linear programming (LP) problem, and then solved by the canonical br...

متن کامل

Algorithms for Computing an Optimal Protein Threading with Profiles and Distance Restraints

Protein threading is one of the powerful methods for protein structure prediction. Some advances are recently made by the significant utilization of distance restraints. Xu et al. proposed a protein threading method in which distance constraints obtained from NMR experiments were taken into account [5]. Young et al. recently developed a novel experimental method to increase the predictive accur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Structure

دوره 18 7  شماره 

صفحات  -

تاریخ انتشار 2010